Robust Inference with Clustered Data
نویسندگان
چکیده
In this paper we survey methods to control for regression model error that is correlated within groups or clusters, but is uncorrelated across groups or clusters. Then failure to control for the clustering can lead to understatement of standard errors and overstatement of statistical signi cance, as emphasized most notably in empirical studies by Moulton (1990) and Bertrand, Du o and Mullainathan (2004). We emphasize OLS estimation with statistical inference based on minimal assumptions regarding the error correlation process. Complications we consider include cluster-speci c xed effects, few clusters, multi-way clustering, more e cient feasible GLS estimation, and adaptation to nonlinear and instrumental variables estimators.
منابع مشابه
Inference for clustered data using the independence log-likelihood
Independence estimating equations (IEEs) are commonly used for the analysis of clustered data. Here, we use the properties of IEEs to adjust the “independence” log-likelihood function in the presence of clustering. The proposed adjustment relies on the robust sandwich estimator of the parameter covariance matrix, which is easy to calculate. We show that the adjusted log-likelihood competes favo...
متن کاملt-Statistic Based Correlation and Heterogeneity Robust Inference
We develop a general approach to robust inference about a scalar parameter when the data is potentially heterogeneous and correlated in a largely unknown way. The key ingredient is the following result of Bakirov and Székely (2005) concerning the small sample properties of the standard t−test: For a significance level of 5% or lower, the t−test remains conservative for underlying observations t...
متن کاملDeveloping a fuzzy inference system to devise proper business strategies: a study on carpet industry
The present article formulates the scenarios that the organization will be probably facing with, using the uncertain factors in business environment, and it also selects the most robust strategies of organization for dealing with the formulated scenarios using the fuzzy information expressed by the experts in fuzzy inference system. The present article aims to provide a method enabling the scen...
متن کاملA Monte Carlo Study to Evaluate the Robust Standard Error Feature in the PHREG procedure
Cox proportional hazards model is the most commonly used regression model to analyze survival data. The regression parameters in a Cox model can be estimated by maximizing the partial likelihood (Cox 1972, 1975). When the association is presented in the data due to matching design, recurrent events or multiple types of failure, the marginal approach can be used to adjust for association. In thi...
متن کاملRobust Potato Color Image Segmentation using Adaptive Fuzzy Inference System
Potato image segmentation is an important part of image-based potato defect detection. This paper presents a robust potato color image segmentation through a combination of a fuzzy rule based system, an image thresholding based on Genetic Algorithm (GA) optimization and morphological operators. The proposed potato color image segmentation is robust against variation of background, distance and ...
متن کاملA Pairwise Likelihood Method for Correlated Binary Data With/without Missing Observations under Generalized Partially Linear Single-index Models
Correlated data, such as multivariate or clustered data, arise commonly in practice. Unlike analysis for independent data, valid inference based on such data often requires proper accommodation of complex association structures among response components within clusters. Semiparametric models based on generalized estimating equations (GEE) methods, and their extensions, have become increasingly ...
متن کامل